Computation circuit for performing vector-matrix multiplication and semiconductor device including the computation circuit

ABSTRACT

A computation circuit includes a computing cell array configured to provide a plurality of physical values respectively corresponding to a plurality of elements of a matrix; a vector input circuit configured to provide a plurality of input voltages corresponding to an input vector to the computing cell array; and a vector output circuit configured to output a plurality of output voltages each corresponding to a dot product between the input vector and a column vector of the matrix according to the plurality of input voltages and the plurality of effective capacitances.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application claims priority under 35 U.S.C. § 119(a) to Korean Patent Application No. 10-2019-0077865, filed on Jun. 28, 2019, which is incorporated herein by reference in its entirety.

BACKGROUND 1. Technical Field

Various embodiments may generally relate to a computation circuit for performing a vector-matrix multiplication and a semiconductor device including the computation circuit.

2. Related Art

Neural networks are widely used in artificial intelligence applications, such as image recognition and technologies used in autonomous vehicles.

FIG. 1 is a block diagram illustrating a structure of a neural network.

A neural network includes an input layer, an output layer, and one or more inner layers between the input layer and the output layer.

In FIG. 1, there are a first inner layer and a second inner layer between the input layer and the output layer.

Each of the output layer, the input layer, and the inner layers includes one or more neurons. Neurons contained in adjacent layers are coupled in various ways through synapses. For example, synapses point from neurons in a given layer to neurons in a next layer. Alternately or additionally, synapses point to neurons in a given layer from neurons in a previous layer.

Each of the neurons stores a value. The values of the neurons included in the input layer are determined according to an input signal, for example, an image to be recognized. The values of the neurons contained in the inner and output layers are based on the neurons and synapses contained in corresponding previous layers. For example, the values of the neurons in each of the inner layers are based on the values of the neurons in a preceding layer in the neural network.

Each of the synapses has a weight. The weight of each of the synapses is based on a training operation of the neural network.

FIG. 2 is a block diagram illustrating a computation operation to determine values of neurons in the second inner layer.

Neurons of the first inner layer is represented with a vector with 4 elements [x₁ x₂ x₃ x₄], neurons of the second inner layer is represented with a vector with 3 elements [h₁ h₂ h₃], and synapses between the first inner layer and the second inner layer is represented with a matrix having 4 rows and 3 columns, wherein a synapse coupling a first element of the second inner layer to a first element of the first inner layer has a weight W₁₁, a synapse coupling a second element of the second inner layer to a first element of the first inner layer has a weight W₂₁, a synapse coupling a second element of the second inner layer to a fourth element of the first inner layer has a weight W₂₄, a synapse coupling a third element of the second inner layer to a fourth element of the first inner layer has a weight W₃₄, and so on. Accordingly, the first element of the second inner layer h₁ will have a value equal to x₁·W₁₁+x₂·W₁₂+x₃·W₁₃+x₄·W₁₄, the second element of the second inner layer h₂ will have a value equal to x₁·W₂₁+x₂·W₂₂+x₃·W₂₃+x₄·W₂₄, and the third element of the second inner layer h₃ will have a value equal to x₁·W₃₁+x₂·W₃₂·x₃·W₃₃+x₄·W₃₄.

As described above, in neural network operations, multiplication operations between a vector and a matrix are frequently performed, and importance of a computation circuit capable of performing the operations efficiently is increasing.

SUMMARY

In accordance with the present teachings, a computation circuit may include a computing cell array configured to provide a plurality of physical values respectively corresponding to a plurality of elements of a matrix; a vector input circuit configured to provide a plurality of input voltages corresponding to an input vector to the computing cell array; and a vector output circuit configured to output a plurality of output voltages each corresponding to a dot product between the input vector and a column vector of the matrix according to the plurality of input voltages and the plurality of effective capacitances.

In accordance with the present teachings, a semiconductor device may include a command decoder configured to receive a command and an address; a data input/output (IO) buffer configured to input or output data according to a control of the command decoder; and a computation circuit configured to generate a plurality of output voltages corresponding to product of an input vector provided from the data IO buffer and a matrix according to a control of the command decoder.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying figures, where like reference numerals refer to identical or functionally similar elements throughout the separate views, together with the detailed description below, are incorporated in and form part of the specification, and serve to further illustrate embodiments of concepts that include the claimed novelty, and explain various principles and advantages of those embodiments.

FIG. 1 is a block diagram illustrating a neural network.

FIG. 2 is a block diagram illustrating an operation performed in the neural network.

FIG. 3 is a block diagram illustrating a computation circuit according to an embodiment of the present disclosure.

FIG. 4 is a block diagram illustrating a vector input circuit according to an embodiment of the present disclosure.

FIG. 5 is a block diagram illustrating a computing cell array according to an embodiment of the present disclosure.

FIG. 6 is a circuit diagram illustrating a computing cell according to an embodiment of the present disclosure.

FIG. 7 is a block diagram illustrating a matrix control circuit according to an embodiment of the present disclosure.

FIG. 8 is a block diagram illustrating a vector output circuit according to an embodiment of the present disclosure.

FIG. 9 is a circuit diagram illustrating an element output circuit according to an embodiment of the present disclosure.

FIG. 10 is a block diagram illustrating a semiconductor device according to an embodiment of the present disclosure.

DETAILED DESCRIPTION

The following detailed description references the accompanying figures in describing illustrative embodiments consistent with this disclosure. The embodiments are provided for illustrative purposes and are not exhaustive. Additional embodiments not explicitly illustrated or described are possible. Further, modifications can be made to presented embodiments within the scope of the present teachings. The detailed description is not meant to limit this disclosure. Rather, the scope of the present disclosure is defined in accordance with the presented claims and equivalents thereof.

FIG. 3 is a block diagram illustrating a computation circuit 1 according to an embodiment of the present disclosure.

The computation circuit 1 includes a vector input circuit 100, a computing cell array 200, a matrix control circuit 300, a vector output circuit 400, and a computing control circuit 500.

The computing control circuit 500 controls each component of the computation circuit 1 to perform a program operation or a computation operation.

For example, the computing control circuit 500 may provide an input switch control signal S_(1m) (1≤m≤i) to the vector input circuit 100 to control a vector input.

In addition, the computing control circuit 500 provides an operation mode signal MODE to the matrix control circuit 300, and provides element data DC_(mn) and index signals m and n of the matrix so that data corresponding to each element of the matrix may be stored.

In addition, the computing control circuit 500 may provide the vector output circuit 400 with a sampling switch control signal S_(2n) (1≤n≤j) and a conversion switch control signal S_(3n) (1≤n≤j) to output the computation result.

Hereinafter, the detailed configuration and operation of the computation circuit 1 will be described with reference to FIGS. 4 to 9.

FIG. 4 is a block diagram illustrating a vector input circuit 100 according to an embodiment of the present disclosure.

The vector input circuit 100 provides a computing cell array 200 with input voltages V_(x1), V_(x2), . . . , V_(xi), where i is a natural number equal to or greater than 2, corresponding to a plurality of elements included in the input vector.

The vector input circuit 100 according to an embodiment of the present disclosure includes input switches 110 ₁, 110 ₂, . . . 110 _(i) corresponding to the number of elements of the input vector.

The input switches 110 ₁, 110 ₂, . . . 110 _(i) respectively provide analog input voltages V_(x1), V_(x2), . . . V_(xi) to the computing cell array 200 in accordance with respective input switch control signals S₁₁, S₁₂, . . . S_(1i).

The vector input circuit 100 may further include a plurality of input buffers 120 ₁, 120 ₂, . . . 120 _(i) which may respectively buffer the analog input voltages V_(x1), V_(x2), . . . V_(xi).

In FIG. 4, an embodiment in which an output signal of the input buffer 120 _(m) is provided to the input switch 110 _(m) is illustrated, where 1≤m≤i, but in another embodiment, the connection order may be changed so that the input switch 110 _(m) may be coupled to the input terminal of the input buffer 120 _(m).

FIG. 5 is a block diagram illustrating a computing cell array 200 according to an embodiment of the present disclosure.

The computing cell array 200 includes a plurality of computing cells 210 ₁₁, 210 ₁₂, . . . 210 _(ij), 210 ₂₁, 210 ₂₂, . . . 210 _(2j), . . . , 210 _(i1), 210 _(i2), . . . 210 _(ij), (collectively referred to herein as the plurality of computing cells 210) where i and j are natural numbers greater than 1.

The plurality of computing cells 210 may be arranged in i rows and j columns so that they may correspond to the elements of the matrix which the input vector will be multiplied by to produce the output vector.

The computing cell array 200 includes a plurality of plate lines 220 ₁, 220 ₂, . . . 220 _(i) (collectively, plate lines 220), a plurality of first word lines 230 ₁, 230 ₂, . . . 230 _(i) (collectively, first word lines 230), a plurality of bit lines 240 ₁, 240 ₂, . . . 240 _(j) (collectively, bit lines 240), and a plurality of control lines 250 ₁, 250 ₂, . . . 250 _(j) (collectively, control lines 250).

In the present embodiment, the plate lines 220 and the first word lines 230 are arranged in parallel to each other in the row direction, the bit lines 240 and the control lines 250 are arranged in parallel to each other in the column direction, and the plates line 220 and the bit lines 240 are arranged to be perpendicular to each other.

Each computing cell 210 _(mn) intersects with the plate line 220 _(m), the first word line 230 _(m), the bit line 240 _(n), and the control line 250 _(n), for n in 1 . . . j and m in 1 . . . i.

Although FIG. 5 illustrates one first word line per row, a plurality of second word lines may be further comprised in each row. This will be described in detail with reference to FIG. 6.

The input voltage V_(xm) corresponding to an element of the input vector is input to each plate line 220 _(m).

The bit line 240 _(n) is commonly coupled to computing cells of the n-th column, for n in 1 . . . j.

The computing cell 210 _(mn) outputs a signal corresponding to the product of m-th element of the input vector (V_(xm), which is input on the plate line 220 _(m)) and (m,n)-th element of the matrix (which value is stored in computing cell 210 _(mn)) to the bit line 240 _(n), for n in 1 . . . j and m in 1 . . . i.

The signal Q_(n) output from the bit line 240 _(n) corresponds to the dot product of the input vector and n-th column vector of the matrix; that is, to the sum of the products output to the bit line 240 _(n) by computing cells 210 _(1n), 210 _(2n), . . . 210 _(in), where 1≤n≤j.

The column selection signal V_(sn) is input to the control line 250 _(n).

The column selection signal V_(sn) may be used to program the corresponding elements of the matrix into computing cells of the n-th column of the computing cell array 200, where 1≤n≤j.

A first word line voltage V_(wm) is applied to the first word line 230 _(m) to program the (m, n)-th element of the matrix to the computing cell 210 _(mn).

A detailed structure of the computing cell 210 _(mn) and an operation related thereto will be described in detail with reference to FIG. 6.

FIG. 6 is a circuit diagram illustrating a computing cell 210 _(mn) according to an embodiment of the present disclosure.

The computing cell 210 _(mn) includes one or more capacitors C_(mn,1), C_(mn,2), . . . C_(mn,k) and one or more NMOS transistors N_(mn,1), N_(mn,2), . . . N_(mn,k), where k is a natural numbers corresponding to a number of bits used to store a value in the computing cell 210 _(mn).

The capacitor C_(mn,p) and the NMOS transistor N_(mn,p), p in 1 . . . k, are coupled in a form similar to a Dynamic Random Access Memory (DRAM) cell (except that the terminal of the capacitor C_(mn,p) not coupled to the NMOS transistor N_(mn,p) is coupled to the plate line 220 _(m) instead of to ground).

That is, one node of the capacitor C_(mn,p) is coupled to the plate line 220 _(m), and the source and drain of the NMOS transistor N_(mn,p) are coupled between the other node of the capacitor C_(mn,p) and the bit line 240 _(n).

The gate of the NMOS transistor N_(mn,p) is coupled to the drain of the floating gate transistor F_(mn,p) among one or more floating gate transistors F_(mn,1), F_(mn,2), . . . F_(mn,k).

The sources of the floating gate transistors F_(mn,1), F_(mn,2), . . . F_(mn,k) are commonly coupled to the source of the switching transistor M_(mn).

The drain of the switching transistor M_(mn) is coupled to the first word line 230 _(m) and the gate of the switching transistor M_(mn) is coupled to the control line 250 _(n).

The control gate of the floating gate transistor F_(mn,p) is coupled to the drain of the switching transistor M_(mn,p) of the one or more switching transistors M_(mn,1), M_(mn,2), . . . M_(mn,k).

The gate of the switching transistor M_(mn,p) is coupled to the control line 250 _(n) and the source is coupled to the second word line 232 _(m,p) of the one or more second word lines 232 _(m,1), . . . 232 _(m,1), . . . 232 _(m,k).

As such, the first word line 230 _(m) and the plurality of second word lines 232 _(m,1), . . . , 232 _(m,k) are coupled to a computing cell 210 _(mn).

In the present embodiment, the value of the (m, n)-th element of the matrix corresponds to a physical value of the computing cell 210 _(mn) such as the effective capacitance between the plate line 220 _(m) and the bit line 240 _(n) in the computing cell 210 _(mn).

In this embodiment, the effective capacitance between the plate line 220 _(m) and the bit line 240 _(n) is determined by which capacitors are coupled between the plate line 220 _(m) and the bit line 240 _(n) among the plurality of capacitors C_(mn,1) . . . C_(mn,k).

The floating gate transistor F_(mn,p) is configured to control the gate voltage of the NMOS transistor N_(mn,p) by injecting charges into the floating gate during a program operation or removing charges from the floating gate using an erase operation.

The program operation and the erase operation performed on the computing cell 210 _(mn) may be controlled by the matrix control circuit 300.

During the program operation, a high level signal is applied to the control line 250 _(n) to turn on the switching transistors M_(mn), M_(mn,1), . . . , M_(mn,k).

Accordingly, the first word line 230 _(m) is coupled to the source of the floating gate transistors F_(mn,p), and the second word line 232 _(m,p) is coupled to the control gate of the floating gate transistor F_(mn,p).

An operation for injecting or removing charges into the floating gate of the floating gate transistor F_(mn,p) (that is, for programming or erasing the floating gate transistor F_(mn,p)) is well known in the art.

That is, electrons are injected into the floating gate of the floating gate transistor F_(mn,p) by applying a positive high voltage to the second word line 232 _(m,p) while the first word line 230 _(m) is grounded. Electrons may be removed from the floating gate of the floating gate transistor F_(mn,p) by applying a negative high voltage to the second word line 232 _(m,p).

When electrons are injected into the floating gate, the threshold voltage of the floating gate transistor is increased. Accordingly, a voltage may be applied to the control gate of the floating gate transistor that will turn the floating gate transistor on if electrons have not been injected into the floating gate, but will not turn the floating gate transistor on if a sufficient number of electrons have been injected into the floating gate.

Computing operations may also be controlled by the matrix control circuit 300.

The computing operation determines whether the NMOS transistor N_(mn,p) is on or off by adjusting the gate voltage of the NMOS transistor N_(mn,p) according to the result of programming the floating gate transistor F_(mn,p), and accordingly determines whether one or more of the capacitors C_(mn,1) . . . C_(mn,k) are conductively coupled between the plate line 220 _(m) and the bit line 240 _(n).

In computing operation, a high level signal is applied to the control line 250 _(n) to turn on the switching transistors M_(mn), M_(mn,1), . . . , M_(mn,k).

The power supply voltage VDD is applied to the first word line 230 _(m) and second word lines 232 _(m,1), . . . , 232 _(m,k) during the computing operation.

Accordingly, the floating gate transistor in which electrons are injected into the floating gate thereof is not turned on, which results in the corresponding NMOS transistor being not turned on, and the floating gate transistor in which electrons are not injected into the floating gate thereof is turned on, which results in the corresponding NMOS transistor being turned on to conductively couple the corresponding capacitor between a corresponding plate line and a corresponding bit line.

The sum of capacitances of the capacitors corresponding to the floating gate transistors in which no electrons are injected into the floating gates thereof in the computing cell 210 _(mn), that is, the effective capacitance C_(mn) of the computing cell 210 _(mn), corresponds to the value of (m, n)-th element of the matrix.

The effective capacitance may be variously determined by increasing the number of capacitors included in the computing cell 210 _(mn). The capacitors included in the computing cell 210 _(mn) may be set to have an identical capacitance or different capacitances. These can be changed according to embodiments. For example, in an embodiment each capacitor C_(mn,p) has a capacitance equal to c*2^((p-1)) for p in 1 . . . k, where c is a constant, so that the effective capacitance C_(mn) may be determined by programming a binary number into the floating gate transistors F_(mn,1) . . . F_(mn,k). In another embodiment, some of the floating gate transistors F_(mn,1) . . . F_(mn,k) may be coupled to more than one transistor-capacitor chain, such that some of the floating gate transistors F_(mn,1) . . . F_(mn,k) respectively control whether more than one capacitor is conductively coupled between a corresponding plate line and a corresponding bit line. For example, each floating gate transistors F_(mn,p), for p in 1 . . . k, may control 2^((p-1)) capacitors through associated transistors, so that the effective capacitance C_(mn) may be determined by programming a binary number into the floating gate transistors F_(mn,1) . . . F_(mn,k).

The computing result in each computing cell 210 _(mn) corresponds to the product of the input voltage V_(xm) input to the plate line 220 _(m) and the effective capacitance C_(mn), which corresponds to the amount of charge charged in the capacitor.

Since the bit line 240 _(n) and the control line 250 _(n) are common to a plurality of computing cells in the same column, the program operation and the computing operation may be performed on a per-column basis. In an embodiment performing a matrix multiplication operation, the computing operation for each column may be performed simultaneously with the computing operation of the other columns.

In addition, since the operation is performed on a column basis, a positive charge corresponding to the dot product of the input vector and the n-th column vector of the matrix may be output from the n-th bit line 240 _(n). That is, each computing cell 210 _(mn), for m in 1 . . . i, coupled to the n-th bit line 240 _(n) and programmed to have an effective capacitance corresponding to the matrix value W_(mn) contributes a charge corresponding to a voltage value of the plate line 220 _(m) to which it is coupled multiplied by its effective capacitance. The contributed charge of the computing cell 210 _(mn) therefor corresponding to x_(m)·W_(mn). The total contribution of charge to the bit line 240 _(n) corresponds to the sum of the charges produced by the computing cells coupled thereto: x₁·W_(1n)+x₂·W_(2n)+ . . . x_(i)·W_(in).

FIG. 7 is a block diagram illustrating a matrix control circuit 300 according to an embodiment of the present disclosure.

The matrix control circuit 300 programs the floating gate transistors as described above with respect to the plurality of computing cells included in the computing cell array 200 to determine an effective capacitance corresponding to each computing cell.

The matrix control circuit 300 provides a plurality of column selection signals V_(sn) to select columns for performing program operations and computing operations, and provides first word line voltages V_(wm) and second word line voltages V_(wm,p) to provide values for programming the floating gate transistors of the computing cells.

The matrix control circuit 300 may perform a program operation or a computing operation by receiving an operation mode signal MODE. The matrix control circuit 300 may receive element data DC_(mn) corresponding to each computing cell and determine a first word line voltage V_(wm) and the second word line voltages V_(wm,p) during a program operation.

Determining which of the NMOS transistor N_(mn,p) are to be turned on according to the value of the element data DC_(mn) may be performed in variously ways by those skilled in the art, and correspondingly, the values of the first word line voltage V_(wm) and second word line voltages V_(wm,p) to be provided in the program operation may be easily determined by a person skilled in the art from the above-described disclosure, and thus a detailed description thereof will be omitted.

Whether an n^(th) column performs a program operation or a computing operation may be determined by adjusting the level of the column selection signal V_(sn).

The computing control circuit 500 may control the matrix control circuit 300 to perform a program operation or a computing operation.

FIG. 8 is a block diagram illustrating a vector output circuit 400 according to an embodiment of the present disclosure.

The vector output circuit 400 includes a plurality of element output circuits 410 ₁, 410 ₂, . . . 410 _(j).

As described above, the charge amount Q_(n) output from the bit line 240 _(n) corresponds to the dot product of the input vector and the n-th column vector of the matrix.

The element output circuits 410 _(n), n in 1 . . . j, converts the charge amount Q_(n) into an output voltage V_(hn) corresponding to the dot product of the input vector and the n-th column vector of the matrix and outputs the output voltage V_(hn).

FIG. 9 is a circuit diagram illustrating an element output circuit 410 _(n) according to an embodiment of the present disclosure.

The element output circuit 410 _(n) includes a sampling capacitor C_(sn), a sampling switch 411 _(n), a conversion switch 412 _(n), and a conversion circuit 413 _(n).

In this embodiment, the sampling switch 411 _(n) is controlled according to the sampling switch control signal S_(2n), and the conversion switch 412 _(n) is controlled according to the conversion switch control signal S_(3n).

In the present embodiment, the sampling switch 411 _(n) is turned off and then turned on after the input switch 110 _(m) of FIG. 4 is turned on to charge the capacitors C_(mn, 1), . . . , C_(mn, k) of FIG. 6.

Accordingly, the charge Q_(n) transferred through the bit line 240 _(n) is transferred to the sampling capacitor C_(sn).

After the sampling switch 411 _(n) is turned off and the conversion switch 412 _(n) is turned on, the conversion circuit 413 _(n) generates an output voltage V_(hn) corresponding to the charge charged in the sampling capacitor C_(sn).

In the present embodiment, the conversion circuit 413 _(n) is an amplifier circuit including an operational amplifier 414 _(n) and a feedback capacitor C_(fn) and generates an output voltage V_(hn) corresponding to the capacitance of the sampling capacitor C_(sn) and the feedback capacitor C_(fn) and buffers the output voltage V_(hn).

As described above, the element output circuit 410 _(n) including the conversion circuit 413 _(n) adjusts the signal such that the output voltage V_(hn) has a value corresponding to the dot product of the input vector and the n-th column vector of the matrix.

FIG. 10 is a block diagram illustrating a semiconductor device 1000 according to an embodiment of the present disclosure.

The semiconductor device 1000 according to the embodiment of the present disclosure includes a structure similar to that of a semiconductor memory device such as a DRAM.

The semiconductor device 1000 includes a command decoder 2 and a data input/output (IO) buffer 3.

The semiconductor device 1000 according to an embodiment of the present disclosure may further include a computation circuit 1, a digital to analog converter (DAC) 4, an element buffer 5, and an analog to digital converter (ADC) 6.

Since the configuration and operation of the computation circuit 1 are as described above, detailed description thereof is omitted.

The command decoder 2 controls the computation circuit 1, the data IO buffer 3, the DAC 4, the element buffer 5, and the ADC 6 according to the command and the address.

The command decoder 2 receives a command and an address from the memory controller.

In this case, the command may include a read command, a write command (a program command), a computation command, and etc.

Write commands include a vector write command for writing an input vector and a matrix write command for writing a matrix.

Input data D_(xm), for m in 1 . . . i, corresponding to elements included in the input vector may be provided to the data IO buffer 3 by executing vector write commands.

In this case, the address corresponding to the vector write command may include information on the element number m of the input vector.

In the present embodiment, the input data D_(xm) is a digital signal and the command decoder 2 controls the DAC 4 to generate an input voltage V_(xm) corresponding to the input data D_(xm) and to provide the input voltage V_(xm) to the computation circuit 1 when a vector write command is provided. In an embodiment, the DAC 4 produces a plurality of input voltages V_(xm) each corresponding to a respective input data D_(xm), for m in 1 . . . i.

The vector input circuit 100 may maintain the input voltages V_(xm) using the buffer 120 _(m).

By executing a matrix write command, element data DC_(mn) corresponding to an element included in the matrix may be provided to the element buffer 5.

In this case, the address corresponding to the matrix write command may include a row number and a column number corresponding to the element of the matrix.

The computation circuit 1 programs a plurality of floating gate transistors of a computing cell 210 _(mn) included in the computing cell array 200 according to the element data DC_(mn) provided from the element buffer 5 and the commands and addresses provided from the command decoder 2.

The computing command indicates that a multiplication of the matrix and the input vector currently input is to be performed.

Accordingly, the command decoder 2 controls the computation circuit 1 to perform computing operations.

In the computing operation, when the column select signal V_(sn) and the first and second word line voltages V_(wm) and V_(wm, p) are applied, depending on whether charge has been injected into the respective floating gates of the floating gate transistors of the computing cell 210 _(mn), effective capacitance C_(mn) of the computing cell 210 _(mn) is determined.

For the computing operation, the input switch 110 _(m) of FIG. 4 is turned on while the sampling switch 411 _(n) of FIG. 9 is turned off to charge the capacitor C_(mn,p) of the computing cell 210 _(mn) according to the input voltage V_(xm).

Thereafter, the sampling switch 411 _(n) is turned on to charge the sampling capacitor C_(sn) according to the charge Q_(n) provided from the bit line 240 _(n).

Thereafter, the sampling switch 411 n is turned off and the conversion switch 412 _(n) is turned on to generate the output voltage V_(hn).

The read command corresponds to an operation of outputting an output voltage V_(hn) corresponding to each element of the output vector.

In the read command, the corresponding address may indicate an element number of the output vector to be read.

When a read command is provided, the output voltage V_(hn) corresponding to the element number is provided by the computation circuit 1 and converted into the output data D_(hn) by the ADC 6, and the converted data is output through the data IO buffer 3.

Although various embodiments have been described for illustrative purposes, it will be apparent to those skilled in the art that various changes and modifications may be made to the described embodiments without departing from the spirit and scope of the disclosure as defined by the following claims. 

What is claimed is:
 1. A computation circuit comprising: a computing cell array configured to provide a plurality of effective capacitances respectively corresponding to a plurality of elements of a matrix; a vector input circuit configured to provide a plurality of input voltages corresponding to an input vector to the computing cell array; and a vector output circuit configured to output a plurality of output voltages each corresponding to a dot product between the input vector and a column vector of the matrix according to the plurality of input voltages and the plurality of effective capacitances, wherein the computing cell array comprises: a plurality of computing cells; a plurality of plate lines; and a plurality of bit lines, wherein each of the plurality of computing cells is located where a respective one of the plurality of plate lines and a respective one of the plurality of the bit lines cross, wherein each of the plurality of computing cells includes a plurality of capacitors selectively coupled between the respective one of the plurality of plate lines and the respective one of the plurality of bit lines; and wherein an effective capacitance between the respective one of the plurality of plate lines and the respective one of the plurality of bit lines respectively corresponds to a value of an element included in the matrix.
 2. The computation circuit of claim 1, wherein each of the plurality of computing cells includes: a plurality of MOS transistors configured to selectively couple the plurality of capacitors with the respective one of the plurality of plate lines; and a plurality of nonvolatile transistors configured to respectively determine on or off states of the plurality of MOS transistors.
 3. The computation circuit of claim 2, wherein the computing cell array further comprises a plurality of first word lines corresponding to a number of rows of the matrix, and wherein each of the plurality of first word lines is coupled to a gate of a corresponding one of the plurality of MOS transistors through a corresponding one of the plurality of nonvolatile transistors.
 4. The computation circuit of claim 3, wherein the computing cell array further comprises a plurality of second word lines, and wherein a gate of each of the plurality of nonvolatile transistors is coupled to one of the plurality of second word lines, and wherein the computing cell array is configured to program each of the plurality of nonvolatile transistors according to a voltage of one of the plurality of second word lines.
 5. The computation circuit of claim 4, wherein the computing cell array further comprises a plurality of control lines, and wherein each of the plurality of computing cells further includes: a transistor configured to selectively couple one of the plurality of first word lines to the plurality of nonvolatile transistors of the computing cell according to one of the plurality of control lines, and a plurality transistors respectively selectively coupling gates of the plurality of nonvolatile transistors of the computing cell to respective ones of the plurality of second word lines according to the one of the plurality of control lines.
 6. The computation circuit of claim 1, wherein the vector output circuit includes a plurality of element output circuits each configured to generate a respective output voltage among the plurality of output voltages according to a signal provided from a respective one of the plurality of bit lines.
 7. The computation circuit of claim 6, wherein each of the plurality of element output circuits includes: a sampling capacitor coupled to the respective one of the plurality of bit lines; and a conversion circuit configured to generate the respective output voltage according to charges charged in the sampling capacitor.
 8. The computation circuit of claim 7, wherein each of the plurality of element output circuits further includes: a sampling switch configured to selectively couple the respective one of the plurality of bit lines and the sampling capacitor; and a conversion switch configured to selectively provide charges charged in the sampling capacitor to the conversion circuit.
 9. The computation circuit of claim 1, wherein the vector input circuits includes: a plurality of input buffers configured to respectively buffer the plurality of input voltages; and a plurality of input switches configured to selectively provide output voltages of the plurality of input buffers to the computing cell array.
 10. The computation circuit of claim 1, further comprising a matrix control circuit configured to set effective capacitances corresponding to each element of the matrix in the computing cell array.
 11. A semiconductor device comprising: a command decoder configured to receive a command and an address; a data input/output (IO) buffer configured to input or output data according to a control of the command decoder; and a computation circuit configured to generate a plurality of output voltages corresponding to product of an input vector provided from the data IO buffer and a matrix according to a control of the command decoder, wherein the computation circuit comprises: a computing cell array configured to set and store a plurality of effective capacitances respectively corresponding to the elements of the matrix; a vector input circuit configured to provide a plurality of input voltages corresponding to an input vector to the computing cell array; and a vector output circuit configured to output a plurality of output voltages each corresponding to a dot product between the input vector and a column vector of the matrix according to the plurality of input voltages and the plurality of effective capacitances, wherein the computing cell array comprises: a plurality of computing cells; a plurality of plate lines; and a plurality of bit lines, wherein each of the plurality of computing cells is located where one of the plurality of plate lines and one of the plurality of the bit lines cross, wherein each of the plurality of computing cells includes a plurality of capacitors selectively coupled between one of the plurality of plate lines and one of the plurality of bit lines; and wherein an effective capacitance between one of the plurality of plate lines and one of the plurality of bit lines corresponds to a value of an element included in the matrix.
 12. The semiconductor device of claim 11, further comprising a digital to analog converter (DAC) configured to convert a data of the data IO buffer into an input voltage corresponding to an element of the input vector and to provide the input voltage with the computation circuit according to a control of the command decoder.
 13. The semiconductor device of claim 11, further comprising an analog to digital converter (ADC) configured to convert the plurality of output voltages into a plurality of output data and provide the output data with the data IO buffer according to a control of the command decoder.
 14. The semiconductor device of claim 11, further comprising an element buffer configured to receive element data corresponding to an element of the matrix from the data IO buffer and to buffer the element data according to a control of the command decoder.
 15. The semiconductor device of claim 11, wherein each of the plurality of computing cells includes: a plurality of MOS transistors configured to selectively couple, respectively, the plurality of capacitors with the one of the plurality of plate lines; and a plurality of nonvolatile transistors configured to respectively determine an on or off state of the plurality of MOS transistors.
 16. The semiconductor device of claim 11, wherein the vector output circuit comprises a plurality of element output circuits, each of the plurality of element output circuits including: a sampling switch configured to selectively couple a respective one of the plurality of bit lines and a sampling capacitor; and a conversion switch configured to selectively provide charges charged in the sampling capacitor to a conversion circuit.
 17. The semiconductor device of claim 11, wherein the vector input circuits includes: a plurality of input buffers configured to respectively buffer the plurality of input voltages; and a plurality of input switches configured to selectively provide voltages of the plurality of input buffers, respectively, to the computing cell array. 